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Abstract 

Genome-wide association studies liave identified a susceptibility variation MUCl rs4072037 for gastric cancer in Chinese 
population. Subsequent case-control studies have reported this association in other populations. However, the results 
remain controversial and ambiguous. The aim of this study is to provide a precise quantification for the association between 
MUCl rs4072037 variation and the risk of cancer. We performed pooled analysis of 10 case-control designed studies 
including 4,220 cases and 6,384 controls. Odds ratios (OR) and 95% confidence interval {95%CI) were calculated to assess 
strength of association in overall studies and in subgroup analysis stratified by ethnicity and cancer types. All statistical 
analyses were performed by Manager 5.0 and Stata 12.0 software. Overall, the MUCl rs4072037 polymorphism was 
associated with risk of cancer in all genetic models (G vs A: OR = 0.71, 95%CI: 0.63-0.80, p<0.01; GA vs AA: OR = 0.61, 
95%CI:0.55-0.67, p<0.01 ; GG vs AA: OR = 0.58, 95%CI: 0.47-0.71 , p<0.01 ; AG-hAA vs GG: OR = 0.60, 95%CI: 0.55-0.60, p<0.01 ; 
GG vs AG-hAA: OR = 0.70, 95%CI: 0.58-0.85, p<0.01). Further, subgroup analysis based on ethnicity suggested MUCl 
rs4072037 polymorphism had a subtly reduced cancer risk among Asian population, and stratified analysis by cancer types 
showed significantly decreased risk of gastric cancer in all genetic models. In conclusion, MUCl rs4072037 polymorphism 
may be used as potential biomarker for cancer susceptibility particularly for gastric cancer and for Asian population. 
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Introduction 

Cancer is the leading cause of death in economically developed 
countries and the second leading cause of death in developing 
countries. The global burden of cancer continues to increase 
largely because of the aging and growth of the world population 
alongside an increasing adoption of cancer-causing behaviors [1]. 
Based on the GLOBOCAN 2008 estimates, about 12.7 million 
cancer cases and 7.6 million cancer deaths are estimated to have 
occurred in 2008 [2]. Cancer arises due to the complex interplay 
of genetic and environmental risk factors, genetic predisposition 
has been considered to be a combined influence factor [3], the 
inheritance of the majority of cancers is polygenic [4]. 

In past decades, a large number of association studies were 
carried out in different cancers, but only few convincing low 
penetrance susceptible loci have been identified until now [5]. 
Recently, Genome-wide association studies (GWAS) play impor- 
tant roles in the identification of potential candidates for single 
nucleotide polymorphisms (SNPs). In 2010, Abnet et al. [6] 
conducted a GWAS on gastric cardia adenocarcinoma (GCA) and 
identified a statistically significant SNP of rs4072037 in mucinl 
(MUCl) gene, which located at lq22 is a synonymous SNP in the 
second exon of MUCl, pointing intricate role of MUCl in 
malignancy. 



The MUCl gene, encodes cell-surface glycoprotein with a 
complex cytoplasmic domain thought to be involved in signal 
transduction [7] , which is associated with attenuation of intracel- 
lular levels of reactive oxygen species (ROS), as well as epithehal 
cancers and with poor survival [8,9]. However, it remains 
unknown whether this locus is associated with an increased risk 
of all cancer types. Because GCA is a digestive system disease, it 
may manifest through different molecular mechanisms. In 
addition, the GWAS was carried out in a Han Chinese population. 
Therefore, we have extensively reviewed the literature and 
performed a meta-analysis based on all eligible published case- 
control studies to evaluate the association between MUCl 
rs4072037 polymorphism and cancer susceptibility. 

Materials and Methods 

Identification and Eligibility of Relevant Studies 

This meta-analysis was conducted by searching PubMed, 
Elsevier, Springer, and the Chinese Biomedical Literature 
database (CBM) for relevant published articles in English and 
Chinese (updated to December, 2013). Combination of the key 
words were as follows: 'MUCl polymorphism', 'MUCl 586 A/G', 
'rs4072037' and 'lq22 polymorphism cancer'. Eligible studies 
were examined carefully to determine whether they accorded with 
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synthesis 
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Studies included in quantitative 
synthesis (meta-analysis) 
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18 articles were excluded, due to: 
(n=8) Not association with MUCl 
(n=3) Letters, meta-analysis, reviews 
(n=7) Not case-control study 



11 articles excluded, due to: 
(n=6) Not explore rs4072037 
(n=2) Not explore cancer 
(n=3) Not explore polymorphisms 



2 articles excluded, due to: 
(n=l) Duplicate publication 
(n=l) Genotype data unavailable 



2 articles excluded, due to: 
(n=2) Not consistent with HWE 



Figure 1. Flow chart of literature search and study selection. 

doi:10.1371/journal.pone.0095651.g001 



the inclusion criteria for the meta-analysis. The studies included 
must meet the following criteria: a) about the rs4072037 
polymorphism and cancer risk, b) from a case-control designed 
study, c) sufficient pubhshed data for estimating an odds ratio (OR) 
with 95% confidence interval (CI), and d) genotype frequencies 
available. 

The studies, in which the genotype of controls for a certain 
polymorphism was not meet the assumptions for Hardy- Weinberg 
equilibrium (HWE), were excluded from the analysis. 

Data Extraction 

Two authors independently extracted data and entered them in 
a customized database. For conflicting evaluations, an agreement 
was reached after a discussion. For each eligible study, the 
following data were extracted: first author's name, publication 
year, country of origin, ethnicity, pathological type, study design, 
detecting sample, genotype method, numbers of genotyped cases 
and controls and HWE of controls, respectively. Ethnic descents 
were categorized as Asian or Caucasian. Study design was 
stratified into hospital-based or population-based studies. If 
original genotype frequency data were not available in relevant 
studies, a request for additional data was sent to the corresponding 
author. 



Statistical Analysis 

Statistical analyses were performed by Manager 5.0. The risks 
of cancer associated with rs4072037 polymorphism were calculat- 
ed direcdy from the data given in the eligible studies. ORs and 
corresponding to 95%CI were used to assess the strength of 
association between rs4072037 polymorphism and cancer risk. 
The pooled ORs were performed for allelic comparison (G vs A), 
heterozygote comparison (OA vs AA), homozygote comparison 
(GG vs AA), dominant model (AA+GA vs GG), and recessive 
model (AA vs GA+GG), respectively. Furthermore, the analyses 
were stratified by ethnicity (Asian, and Caucasian), cancer type (if 
only one cancer type contained fewer than two individual studies it 
was combined into the 'Other Cancers' group) and source of 
controls. 

We assessed the departure from the HWE for the control group 
in each study using Pearson's goodness-of-fit x test with 1 degree 
of freedom. 

Heterogeneity across studies was evaluated by Cochran's Q,test 
and f, which represents the percentage of total variation across 
studies that is attributable to heterogeneity. Heterogeneity was 
considered statistically significant when P<0.05 or f>50%, then 
the pooled OR estimate of each study was calculated by the 
random-effects model [10]. Otherwise, the fixed-effect model was 
used [1 1] . The significance of overall odds ratio (OR) was 
determined by the Z-test. If there were significant heterogeneity 
among included studies, the sources of heterogeneity would be 
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explored using meta regression in Stata version 12.0 (http:/ /www. 
stata.com). 

To assess the stability of the results, sensitivity was carried out by 
omitting each study in turn to assess the quality and consistency of 
the results. Publication bias was assessed using funnel plots, Begg's 
test (rank correlation test), and Egger's test (weighted linear 
regression test) for funnel plot symmetry [12]. These analyses were 
performed using Stata version 12.0 software. 

Statistical tests performed in the present study were considered 
significant whenever the corresponding null-hypothesis probability 
was P<0.05. 

Results 

Characteristics of the Studies 

Based on the search criteria and inclusion criteria, a total of 1 3 
studies available from 10 articles for MUCl rs4072037 polymor- 
phism were rnvestigated[13-21], the study selection process is 
shown in Figure 1, as summarized in Table 1, in which 3 studies 
(two articles) [15,21] genotype of controls for a certain polymor- 
phism were not consistent with HWE, were excluded from the 
analysis. Hence, a total of 10 studies including 4,220 cases and 
6,384 controls were used in the meta-analysis. All studies were 
case-control designed, including 3 cancer types, in which 8 studies 
on gastric cancer, and one each on colorectal cancer and breast 
cancer. There were 8 studies of Asian descendent and 2 of 
Caucasian descendent. A classic TaqMan assay was used in 4 out 
of 10 studies. 7 studies were randomly repeated a portion of 
samples as quality control while genotyping. 

Quantitative Synthesis 

The pooled analyses and the heterogeneity test are shown in 
Table 2 and Figure 2. A significantly decreased cancer risk was 
found in all genetic models, including the allelic genetic model (G 
vs A: OR = 0.71, 95%CI: 0.63-0.80, /<<0.01), heterozygous 
genetic model (GA vs AA: OR = 0.61, 95%CI:0.55-0.67, p< 
0.01), homozygous genetic model (GG vs AA: OR = 0.58, 95%CI: 
0.47-0.71, j6<0.01), the dominant model (AG+AA vs GG: 
OR = 0.60, 95%CI: 0.55-0.60, /)<0.01), and the recessive model 
(GG vs AG+AA: OR = 0.70, 95%CI: 0.58-0.85, ^<0.01). 

Further stratification analysis by ethnicity, the results showed 
that MUCl rs4072037 polymorphism was significantly linked to 
cancer risk (table 3). Overall, individuals carrying G allelic had a 
subtly reduced cancer risk among Asian population (G vs A: 
OR=0.69, 95%CI: 0.61-0.79, /)<0.01; GA vs AA: OR = 0.61, 
95%CI:0.55-0.67,/)<0.01; GG vs AA: OR=0.56, 95%CI: 0.43- 
0.73, /)<0.01; AG+AA vs GG: OR=0.60, 95%CI: 0.54-0.66, ^< 
0.01; GG vs AG+AA: OR = 0.64, 95%CI: 0.49-0.82, ^<0.01). In 
Caucasian population, MUCl rs4072037 polymorphism was 
significantiy associated with an decreased cancer risk in all genetic 
models except for homozygous genetic model and recessive model 
(G vs A: OR = 0.76, 95%CI: 0.64-0.90, p<Q.Q\\ GA vs AA: 
OR=0.61, 95%CI:0.46-0.80, /)<0.01; AG+AA vs GG: 
OR=0.58, 95%CI: 0.44-0.75, /)<0.01). 

Subgroup analysis was also stratified by cancer types, in 
different types of cancer, MUCl rs4072037 polymorphism was 
significandy associated with an decreased risk of gastric cancer in 
all genetic models (G vs A: OR = 0.68, 95%CI: 0.60-0.77, p< 
0.01; GA vs AA: OR = 0.59, 95%CI:0.53-0.66, p<0.0\; GG vs 
AA: OR = 0.52, 95%CI: 0.44-0.66, p<QM\ AG+AA vs GG: 
OR = 0.58, 95%CI: 0.53-0.64, p<Q.Q\; GG vs AG+AA: 
OR=0.65, 95%CI: 0.52-0.80, p<0.0\). No significant associa- 
tions were found in the "others" group (colorectal cancer and 
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Figure 2. Forest plot of cancer risk associated with MUC1 rs4072037 for Kiomozygous genetic model (GG vs AA). The squares and 
horizontal lines correspond to the study-specific OR and 95%CI. The area of the squares reflects the study specific weight. The diamond represents 
the pooled OR and 95%CI. 
doi:1 0.1 371 /journal.pone.0095651 .g002 



Egger's test was used to provide statistical evidence for funnel plot 
symmetry (Table 4). 

Discussion 

MUCl is an extracellular protein anchored to the epithelial 
surface and engage in morphogenetic signal transduction [22]. 
Rs4072037 was observed to disrupt the physiological functions of 
MUCl [19], which might due to alternative splicing of the 5'- 
region of exon 2 controlled by rs4072037 and ultimately result in 
failure to the physiological protection of gastric mucosa [23]. 

GWAS was referred to as an unbiased and fairly comprehensive 
approach to explore the genetic variation related to cancer. Abnet 
et al. [6] reported the strong association of the new and novel low- 
penetrance susceptibility locus rs4072037 with the decreased risk 
of gastric cancer in the Chinese population by a large-scale 
GWAS. Replications of the association identified in GWAS by 
other independent studies were performed, but the study 
populations were not all Chinese, and the results were conflicting 
and inconclusive because of various reasons, such as different 
ethnicities, resident areas, sample size, and environmental factors. 
Meta-analysis, as an important statistical method has more 
statistical power than a single study, which can quantitatively 
combine analyses from different studies. To provide a more 
comprehensive analysis on the association, we producted this 
meta-analysis based on ten case-control designed studies with 
10,568 participants. 

In the present study, we firstly analyzed the association of 
MUCl rs4072037 of cancer in the overall cancer types. The results 
of our meta-analysis indicated that the G allele of MUCl 
rs4072037 variation was significandy associated with decreased 
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breast cancer) except the dominant model (AG+AA vs GG: 
OR = 0.72, 95%CI: 0.55-0.95, /)<0.01) (Table 3, Figure 3). 

When studies were segregated based on source of control, 
MUCl rs4072037 polymorphism was found to be associated with 
decreased cancer risk, the results were equal to subgroup analysis 
of cancer subtypes (Table 3). 

Test of Heterogeneity 

When evaluating the association between the MUCl rs4072037 
polymorphism and the susceptibility to cancer, we found that there 
was significant heterogeneity for the allelic comparison (G vs A: 

-Pheterogeneity= 0.01, 7^ = 63%). TllUS, WC aSSCSScd the SOUrCC of 

heterogeneity for the allelic genetic model comparison, meta 
regression in Stata 12.0 was used to assess by cancer type, matched 
controls (yes or not), source of control (population or hospital), 
assay, sample size (500 as the boundary) and quality control (with 
or not). It was detected that the systemic results were not affected 
by these characteristics (data not shown). 

Sensitivity Analysis 

Sensitivity analyses were performed to assess the influence of 
each individual study on the pooled ORs by the systematic 
omission of the individual study from the analyses. The 
corresponding pooled ORs were not materially altered, indicating 
that our results were statistically robust (data not shown). 

Publication Bias 

Begg's funnel plot and Egger's linear regression test were 
performed to assess the publication biases of included studies. The 
shapes of the funnel plots did not reveal any evidence of obvious 
asymmetry under the dominant model (Figure 4). Then, the 

Table 2. Main results of pooled ORs in the meta-analysis. 
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Figure 3. Forest plot of cancer risk associated with MUCl rs4072037 for the homozygous genetic model (GG vs AA) stratified by 
cancer types. The squares and horizontal lines correspond to the study-specific OR and 95%CI. The area of the squares reflects the study specific 
weight. The diamond represents the pooled OR and 95%CI. 
doi:10.1371/journal.pone.0095651.g003 
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Figure 4. Funnel plot of MUCl rs4072037 polymorphism and cancer risk for homozygous genetic model (GG vs AA). 

doi:10.1371/journal.pone.0095651.g004 



risk of cancer. In addition, subgroup analyses revealed that MUCl 
rs4072037 G allele was associated with a decreased risk for 
progress of gastric cancer,especiaUy in Asians. It indicated that a 
possible ethnic difference in the genetic background. Moreover, 
similar results have been obtained by Zheng et al [24] . The results 
suggested that MUCl rs4072037 polymorphism might be involved 
in gastric carcinogenesis and tumor difiFerentiation among Asian 



populations. The results may be important for interpretation of 
genetic variance for susceptibility to gastric cancer and facilitate to 
be used as diagnostic markers in cancer, and are under 
investigation as therapeutic targets for cancer. 

Although meta-analysis is robust, our meta-analysis had some 
limitations common to these types of studies. Firstly, because only 
published studies were retrieved in the meta-analysis, publication 



Table 4. Publication bias of MUCl rs4072037 for Egger's test. 
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bias might be possible, even though the statistical test did not show 
it. Secondly, our meta-analysis did not evaluate any potential 
gene-gene interaction and gene-environment interaction due to 
lack of relevant published data. Thirdly, although perfect 
searching strategy was used to identify eligible studies for current 
meta-analysis, it was stiU possible that a few studies meeting 
inclusion criteria were not included. 

In summary, this meta-analysis suggested that the MUCl 
rs4072037 polymorphism was significantly associated with de- 
creased risk of cancer, especially for gastric cancer and Asian 
population. However, further well-designed studies in large cohort 
of different ethnic origins and cancer types are needed before the 
application of MUCl rs4072037 polymorphism as cancer 
biomarker in clinical settings. 
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